> 






/ 



tl¥.© S'®?!© 



Serial Number: _ /o/7^:^ lol 2 CRF Edit Date: 

r" IV I -r-r— - Edited by: h^iS ' 



ENTEtit: . 

Realigned nucleic acid/aiiiind acid niimbers/text 
text ‘Svrapped” to the next line 



in cases where the sequence 



Corrected the SEQ ID NO. Sequence numbers edited were: 



Inserted or corrected a nucleic number at the end of a nucleic line. SEO ID 
NO’s edited: 



deleted: invalid beginning/end-of-file text ; page numbers 



Inserted mandatory headings/numeric identifiers, specifically: 



Moved responses to same line as heading/numeric identifier, specifically: 









Revised 09/09/2003 




J 



Page 1 of 9 




RAW SEQUENCE LISTING DATE: 03/08/2004 

PATENT APPLICATION: US/10/785,673 TIME: 15:22:42 

Input Set : A:\PTO.AMC.txt 

Output Set: N:\CRF4\03082004\J785673.raw 



4 

C — > 6 

7 

8 
9 

10 

11 

13 

14 
16 
18 

19 

20 
21 
22 

23 

24 
26 

■ 27 
28 

29 

30 
32 

C— > 33 
C~> 34 

35 

41 

38 

39 

42 

43 

45 

46 

47 
49 

51 

52 

53 

54 
57 

59 

60 



SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

(i) APPLICANT: MINETTI , CONCEICAO; 

MICRON, FRANCIS; 

PULLEN, JEFFREY K. ; 

POLDVINO-BODNAR, MARYELLEN; 

LIANG, SHU-MEI; 

TAI, JOSEPH Y. 

(ii) TITLE OF INVENTION: MODIFIED IMMUNOGENIC 

PNEUMOLYSIN COMPOSITIONS AS VACCINES 
(ill) NUMBER OF SEQUENCES: 18 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: MORGAN & FINNEGAN, L.L.P. 

(B) STREET: 345 PARK AVENUE 

(C) CITY: NEW YORK 

(D) STATE: NEW YORK 

(E) COUNTRY: USA 

(F) ZIP: 10154 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: FLOPPY DISK 

(B) COMPUTER: IBM PC COMPATIBLE 

(C) OPERATING SYSTEM: PC-DOS /MS-DOS 

(D) SOFTWARE: MICROSOFT WORD 97 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: US/10/785,673 

(B) FILING DATE: 23-Feh>-2004 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 60/053,306 

(B) FILING DATE: 1997-07-21 

(A) APPLICATION NUMBER: 60/073,456 

(B) FILING DATE: 1998-02-02 
(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: DARRYL H. STEENSMA 

(B) REGISTRATION NUMBER: 43,155 

(C) REFERENCE/ DOCKET NUMBER: 1758-4 036US2 
(ix) TELECOMMUNICATION INFORMATION; 

(A) TELEPHONE: (212) 758-4800 

(B) TELEFAX: (212) 751-6849 

(C) TELEX: 421792 

(2) INFORMATION FOR SEQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1413 
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RAW SEQUENCE LISTING 

PATENT APPLICATION: US/lO/785 , 673 



DATE: 

TIME: 



Input Set : A:\PTO.AMC.txt 

Output Set: N:\CRF4\03082004\J785673.raw 



(B) TYPE: nucleic acid 

62 (C) STRANDEDNESS: double 

63 (D) TOPOLOGY: linear 

65 (ii) MOLECULE TYPE: cDNA 

67 (vi) ORIGINAL SOURCE: 

63 (A) ORGANISM: S. pneumoniae 

70 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

C > 72 ATGGCAAATA AAGCAGTAAA TGACTTTATA CTAGCTATGA 40 
74 ATTACGATAA AAAGAAACTC TTGACCCATC AGGGAGAAAG 80 
76 TATTGAAAAT CGTTTCATCA AAGAGGGTAA TCAGCTACCC 120 
78 GATGAGTTTG TTGTTATCGA AAGAAAGAAG CGGAGCTTGT 160 
8 0 CGACAAATAC AAGTGATATT TCTGTAACAG CTACCAACGA 2 00 
82 CAGTCGCCTC TATCCTGGAG CACTTCTCGT AGTGGATGAG 240 
84 ACCTTGTTAG AGAATAATCC CACTCTTCTT GCGGTCGATC 280 
86 GTGCTCCGAT GACTTATAGT ATTGATTTGC CTGGTTTGGC 320 
88 AAGTAGCGAT AGCTTTCTCC AAGTGGAAGA TCCCAGCAAT 360 
90 TCAAGTGTTC GCGGAGCGGT AAACGATTTG TTGGCTAAGT 400 
92 GGCATCAAGA TTATGGTCAG GTCAATAATG TCCCAGCTAG 440 
94 AATGCAGTAT GAAAAAATCA CGGCTCACAG CATGGAACAA 4 80 
97 CTCAAGGTCA AGTTTGGTTC TGACTTTGAA AAGACAGGGA 520 
99 ATTCTCTTGA TATTGATTTT AACTCTGTCC ATTCAGGCGA 560 
101 AAAGCAGATT CAGATTGTTA ATTTTAAGCA GATTTATTAT 600 
103 ACAGTCAGCG TAGACGCTGT TAAAAATCCA GGAGATGTGT 640 
105 TTCAAGATAC TGTAACGGTA GAGGATTTAA AACAGAGAGG 680 
107 AATTTCTGCA GAGCGTCCTT TGGTCTATAT TTCGAGTGTT 720 
109 GCTTATGGGC GCCAAGTCTA TCTCAAGTTG GAAACCACGA 760 
111 GTAAGAGTGA TGAAGTAGAG GCTGCTTTTG AAGCTTTGAT 8 00 
113 AAAAGGAGTC AAGGTAGCTC CTCAGACAGA GTGGAAGCAG 84 0 
115 ATTTTGGACA ATACAGAAGT GAAGGCGGTT ATTTTAGGGG 880 
117 GCGACCCAAG TTCGGGTGCC CGAGTTGTAA CAGGCAAGGT 920 
119 GGATATGGTA GAGGACTTGA TTCAAGAAGG CAGTCGCTTT 960 
121 ACAGCAGATC ATCCAGGCTT GCCGATTTCC TATACAACTT 1000 
123 CTTTTTTACG TGACAATGTA GTTGCGACCT TTCAAAATAG 1040 
125 TACAGACTAT GTTGAGACTA AGGTTACAGC TTACAGAAAC 1080 
127 GGAGATTTAC TGCTGGATCA TAGTGGTGCC TATGTTGCCC 1120 

12 9 AATATTATAT TACTTGGTU^T GAATTATCCT ATGATCATCA 1160 
131 AGGTAAGGAA GTCTTGACTC CTAAGGCTTG GGACAGAAAT 1200 
133 GGGCAGGATT TAACGGCTCA CTTTACCACT AGTATTCCTT 1240 
135 TAAAAGGGAA TGTTCGTAAT CTCTCTGTCA AAATTAGAGA 1280 
137 GTGTACCGGG CTTGCTTGGG AATGGTGGCG TACGGTTTAT 1320 

13 9 GAAAA7\ACCG ATTTGCCACT AGTGCGTAAG CGGACGATTT 13 60 
141 CTATTTGGGG AACAACTCTC TATCCGCAGG TAGAAGATAA 1400 
144 GGTAGAAAAT GAC 1413 

147 (2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1413 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



149 

150 

151 

152 

153 
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RAW SEQUENCE LISTING 

PATENT APPLICATION: US/10/785,673 

Input Set : A:\PTO.AMC.txt 

Output Set: N:\CRF4\03082004\J785673.raw 



DATE : 
TIME: 



155 

157 

158 
160 

C — > 162 



164 

166 

168 

170 

172 

174 

176 

178 

180 

182 

184 

186 

188 

191 

193 

195 

197 

199 

201 

203 

205 

207 

209 

211 

213 

215 

217 

219 

221 

223 

225 

227 

229 

231 

233 

237 

239 

240 

241 

242 

243 
W — > 245 

247 

248 



(ii) MOLECULE TYPE: cDNA 
(vi) ORIGINAL SOURCE: 

(A) ORGANISM: S. pneumoniae 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 
ATGGCAAATA AAGCAGTAAA TGACTTTATA CTAGCTATGA 40 
ATTACGATAN AAANAAACTC TTGACCCATC AGGGAGAAAG 80 
TATTGAAAAT CGTTTCANCA AAGAGGGTAA TCAGCTACCC 120 
GNTGAGTTTG TTGNTANCGA AAGAAAGAAG CGGAGCTTGT 160 
CGACAAATAC AAGTGATATT NCTGTANCAG CTACCNACGA 200 
CAGTCGCCTC TATCCTGGAG CACTTCTCGT AGTGGATGAG 240 
ACCTTGTNAG AGAATAATCC CACTCTTCTT GCGGTNGATC 280 
GTGCTCCGAT GACTTATAGT ANTGNTTTGC CTGGTTTGGC 320 
AAGTAGCGAT AGCTTTCTCC AAGTGGAAGA NCCCAGCAAT 360 
TCAAGTGTTC GCGGAGCGGN ANACGATTTG TTGGCTAAGT 400 
GGCATCAAGA TTATGGTCAG GTCAATAATG TCCCAGCTAG 440 
AANGCAGTAT GAAAAAATNA CGGCTCACAG CATGGAACAA 4 80 
CTCAAGGTCA AGTTTGGTTC TGACTTTGAA AAGNCAGGGA 520 
ATTCTCTTGA TATTGATTTT AACTCTGTCC ATTCAGGNGA 560 
AAAGCNGATT CAGATTGTTA ATNTTAAGCA GATTTATTAT 600 
ACAGTCAGCG TAGACGCTGT TAAAAATCCA GGAGATGTGT 640 
TTCAAGATAC TGTAACGGTA GAGGATTTAA AACAGAGAGG 680 
AATTTCTGCA GAGCGTCCTT TGGTCTATAT TTCGAGNGTT 720 
GCTTATGGGC GCCAAGTCTA TCTCAAGTTG GAAACCACGA 760 
GTANGAGTGN TGAAGTAGAG GCTGCTTTTG AAGCTTTGAT 800 
AAAAGGAGTC AAGGTAGCTC CTCAGACAGA GTGGAAGCAG 840 
ATTTTGGACA ATACAGAAGT GAAGGCGGTT ATTTTAGGGG 880 
GCGACCCAAG TTCGGGTGCC CGAGTTGTAA CAGGCAAGGT 920 
GGATATGGTA GAGGACTTGA TTCAAGAAGG CAGTCGCTTT 960 
ACAGCAGATC ATCCAGGCTT GCCGATTTCC TATACAACTT 1000 
CTTTTTTACG TGACAATGTA GTTGCGACCT TTCAAAANAG 1040 
TACAGACTAT GTTGAGACTA AGGTTACAGC TTACAGAAAC 108 0 
GGAGATTTAC TGCTGGATCA TAGTGGTGCC TATGTTGCCC 1120 
AATATTATAT TACTTGGNAT GAATTATCCT ATGATCATCA 1160 
AGGTAAGGAA GTCTTGACTC CTAAGGCTTG GGACAGAAAT 1200 
GGGCAGGATT TNACGGCTCA CTTTACCACT AGTATTCCTT 1240 
TAAAAGGGAA TGTTCGTAAT CTCTCTGTCA AAATTAGAGA 1280 
GTGTACCGGG CTTGCNTGGG AATGGTGGCG TACGGTTTAT 1320 
GAAAAAACCG ATTTGCCACT AGTGCGTAAG CGGACGATTT 1360 
CtaTTTGGGG AACAACTCTC TATCCNCAGG TAGANGATAA 1400 
GGTAGAAAAT GAC 1413 
(2) INFORMATION FOR SEQ ID NO; 3: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 471 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: amino acid 
(vi) ORIGINAL SOURCE; 

(A) ORGANISM: S. pneumoniae 



03/08/2004 
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RAW SEQUENCE LISTING 

PATENT APPLICATION: US/lO/785 , 673 

Input Set : A:\PTO.AMC.txt 

Output Set: N:\CRF4\03082004\J785673.raw 



DATE; 

TIME; 



250 

252 

253 

254 

255 

256 

257 

258 

259 

260 
261 
262 

263 

264 

265 

266 

267 

268 

269 

270 

271 

272 

273 

274 

275 

276 

277 

278 

279 
281 
282 

283 

284 

285 

286 

287 

288 

289 

290 

291 

292 

293 

294 

295 

296 

297 

298 

299 

300 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

Met Ala Asn Lys Ala Val Asn Asp Phe lie Leu Ala 

^ 5 10 

Met Asn Tyr Asp Lys Lys Lys Leu Leu Thr His Gin 

15 20 

Gly Glu Ser lie Glu Asn Arg Phe lie Lys Glu Gly 
25 30 35 

Asn Gin Leu Pro Asp Glu Phe Val Val He Glu Arg 
40 45 

Lys Lys Arg Ser Leu Ser Thr Asn Thr Ser Asp He 

55 60 

Ser Val Thr Ala Thr Asn Asp Ser Arg Leu Tyr Pro 
65 70 

Gly Ala Leu Leu Val Val Asp Glu Thr Leu Leu Glu 
75 80 

Asn Asn Pro Thr Leu Leu Ala Val Asp Arg Ala Pro 
85 90 95 

Met Thr Tyr Ser He Asp Leu Pro Gly Leu Ala Ser 
100 105 

Ser Asp Ser Phe Leu Gin Val Glu Asp Pro Ser Asn 
110 115 120 

Ser Ser Val Arg Gly Ala Val Asn Asp Leu Leu Ala 
125 130 

Lys Trp His Gin Asp Tyr Gly Gin Val Asn Asn Val 
135 140 

Pro Ala Arg Met Gin Tyr Glu Lys He Thr Ala His 
145 150 155 

Ser Met Glu Gin Leu Lys Val Lys Phe Gly Ser Asp 
160 165 

Phe Glu Lys Thr Gly Asn Ser Leu Asp He Asp Phe 
170 175 180 

Asn Ser Val His Ser Gly Glu Lys Gin He Gin He 
185 190 

Val Asn Phe Lys Gin He Tyr Tyr Thr Val Ser Val 
195 200 

Asp Ala Val Lys Asn Pro Gly Asp Val Phe Gin Asp 
205 210 215 

Thr Val Thr Val Glu Asp Leu Lys Gin Arg Gly He 
220 225 

Ser Ala Glu Arg Pro Leu Val Tyr He Ser Ser Val 
230 235 240 

Ala Tyr Gly Arg Gin Val Tyr Leu Lys Leu Glu Thr 
245 250 

Thr Ser Lys Ser Asp Glu Val Glu Ala Ala Phe Glu 
255 260 

Ala Leu He Lys Gly Val Lys Val Ala Pro Gin Thr 
265 270 275 

Glu Trp Lys Gin He Leu Asp Asn Thr Glu Val Lys 
280 285 
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RAW SEQUENCE LISTING 

PATENT APPLICATION; US/lO/785 , 673 

Input Set ; A:\PTO.AMC.txt 

Output Set: N:\CRF4\03082004\O785673.raw 



DATE: 

TIME: 



301 

302 

303 

304 

305 

306 

307 

308 

309 

310 

311 

312 

313 

314 

315 

316 

317 

318 

319 

320 

321 

322 

323 

324 

325 

326 

328 

329 

330 

331 

332 

333 
336 
338 



339 

340 

341 

342 
W — > 344 

346 

347 
349 

351 

352 
W — > 353 

354 

355 

356 

357 



Ala Val lie Leu Gly Gly Asp Pro Ser Ser Gly Ala 
290 295 3QQ 

Arg Val Val Thr Gly Lys Val Asp Met Val Glu Asp 
305 310 

Leu He Gin Glu Gly Ser Arg Phe Thr Ala Asp His 
315 320 

Pro Gly Leu Pro He Ser Tyr Thr Thr Ser Phe Leu 
325 330 335 

Arg Asp Asn Val Val Ala Thr Phe Gin Asn Ser Thr 
340 345 

Asp Tyr Val Glu Thr Lys Val Thr Ala Tyr Arg Asn 
350 355 250 

Gly Asp Leu Leu Leu Asp His Ser Gly Ala Tyr Val 
365 370 

Ala Gin Tyr Tyr lie Thr Trp Asn Glu Leu Ser Tyr 
375 330 

Asp His Gin Gly Lys Glu Val Leu Thr Pro Lys Ala 
335 390 395 

Trp Asp Arg Asn Gly Gin Asp Leu Thr Ala His Phe 
400 405 

Thr Thr Ser He Pro Leu Lys Gly Asn Val Arg Asn 
410 415 ^20 

Leu Ser Val Lys He Arg Glu Cys Thr Gly Leu Ala 
425 430 

Trp Glu Trp Trp Arg Thr Val Tyr Glu Lys Thr Asp 
435 440 

Leu Pro Leu Val Arg Lys Arg Thr He Ser He Trp 
445 450 455 

Gly Thr Thr Leu Tyr Pro Gin Val Glu Asp Lys Val 
460 465 

Glu Asn Asp 
470 

(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 471 

(B) TYPE: amino acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: amino acid 
(vi) ORIGINAL SOURCE: 

(A) ORGANISM: S. pneumoniae 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Met Ala Asn Lys Ala Val Asn Asp Phe He Leu Ala 

^ 5 10 

Met Asn Tyr Asp Xaa Xaa Lys Leu Leu Thr His Gin 

15 20 

Gly Glu Ser He Glu Asn Arg Phe Xaa Lys Glu Gly 
35 30 35 

Asn Gin Leu Pro Xaa Glu Phe Val Xaa Xaa Glu Arg 
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IFWO 



9 

10 

11 

13 

14 
16 
18 

19 

20 
21 
22 

23 

24 
26 

27 

28 

29 

30 
32 

C — > 33 
C — > 34 

35 

41 

38 

39 

42 

43 

45 

46 

47 
49 

51 

52 

53 

54 



RAW SEQUENCE LISTING DATE: 03/05/2004 

PATENT APPLICATION: US/10/785,673 TIME: 12:01:54 

Input Set : A:\17584036.txt 

Output Set: N:\CRF4\03052004\J785673.raw 

SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

(i) APPLICANT: MINETTI, CONCEICAO; 

MICRON, FRANCIS; 

PULLEN, JEFFREY K. ; 

POLDVINO-BODNAR, MARYELLEN; 

LIANG, SHU-MEI; 

TAI, JOSEPH Y. 

(ii) TITLE OF INVENTION: MODIFIED IMMUNOGENIC 

PNEUMOLYSIN COMPOSITIONS AS VACCINES 

(111) NUMBER OF SEQUENCES: 18 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: MORGAN & FINNEGAN, L.L.P. 

(B) STREET: 345 PARK AVENUE 

(C) CITY: NEW YORK 

(D) STATE: NEW YORK 

(E) COUNTRY: USA 

(F) ZIP: 10154 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: FLOPPY DISK 

(B) COMPUTER: IBM PC COMPATIBLE 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: MICROSOFT WORD 97 f) j 

(vi) CURRENT APPLICATION DATA: ‘ 

(A) APPLICATION NUMBER: US/10/785,673 \j 

(B) FILING DATE : 23-Feb-2004 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 60/053,306 

(B) FILING DATE: 1997-07-21 

(A) APPLICATION NUMBER: 60/073,456 

(B) FILING DATE: 1998-02-02 
(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: DARRYL H. STEENSMA 

(B) REGISTRATION NUMBER: 43,155 

(C) REFERENCE/DOCKET NUMBER: 1758-4036US2 
(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (212) 758-4800 

(B) TELEFAX: (212) 751-6849 

(C) TELEX: 421792 
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RAW SEQUENCE LISTING DATE * 

PATENT APPLICATION: US/10/785 , 673 TIME; 

Input Set ; A:\17584036.txt 

Output Set: N:\CRF4\03052004\j785673.raw 



ERRORED SEQUENCES 



336 

338 

339 

340 

341 

342 
W— > 344 

346 

347 
349 

351 

352 
W~> 353 

354 

355 

356 

357 

358 

359 

360 

361 

362 

363 

364 

365 

366 

367 

368 

369 

370 

371 

372 

373 

374 

376 

377 

378 

379 

380 
E — > 381 

382 

E — > 383 

384 

E — > 385 

386 

E — > 387 

388 

E — > 389 



E — > 



E — > 



(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 471 

(B) TYPE: amino acid 

(C) STRANDEDNESS; unknown 

(D) TOPOLOGY: linear 
(ix) MOLECULE TYPE : sunino acid. 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: S. pneumoniae 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Met Ala Asn Lys Ala Val Asn Asp Phe lie Leu Ala 

1 5 10 

Met Asn Tyr Asp Xaa Xaa Lys Leu Leu Thr His Gin 

15 20 

Gly Glu Ser lie Glu Asn Arg Phe Xaa Lys Glu Glv 
25 30 35 

Asn Gin Leu Pro Xaa Glu Phe Val Xaa Xaa Glu Arg 
40 45 

Lys Lys Arg Ser Leu Ser Thr Asn Thr Ser Asp lie 
50 55 gQ 

Xaa Val Xaa Ala Thr Xaa Asp Ser Arg Leu Tyr Pro 
65 70 

Gly Ala Leu Leu Val Val Asp Glu Thr Xaa Leu Glu 
75 80 

Asn Asn Pro Thr Leu Leu Ala Val Asp Arg Ala Pro 
85 90 95 

Met Thr Tyr Ser Xaa Xaa Leu Pro Gly Leu Ala Ser 
100 105 

Ser Asp Ser Phe Leu Gin Val Glu Asp Pro Ser Asn 

110 115 120 

Val Arg Gly Ala Xaa Xaa Asp Leu Leu Ala 

125 130 

Lys Trp His Gin Asp Tyr Gly Gin Val Asn Asn Val 
135 140 

Pro Ala Arg Xaa Gin Tyr Glu Lys Xaa Thr Ala His y/ 

Ko IS® isS^ 1-&5- 

Ser Met Glu Gin Leu Lys Val Lys Phe Gly Ser Asp 

160 165 

Phe Glu Lys Xaa Gly Asn Ser Leu Asp lie Asp Phe 

170 175 180 

Asn Ser Val His Ser Gly Glu Lys Xaa He Gin He 

185 190 

Val Asn Xaa Lys Gin He Tyr Tyr Thr Val Ser Val 

195 200 

Asp Ala Val Lys Asn Pro Gly Asp Val Phe Gin Asp 
205 210 215 

Thr Val Thr Val Glu Asp Leu Lys Gin Arg Gly He 

220 225 
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RAW SEQUENCE LISTING 

PATENT APPLICATION: US/10/785 , 673 



DATE: 03/05/2004 
TIME: 12:01:54 



Input Set : A:\17584036.txt 

Output Set: N:\CRF4\03052004\j785673.raw 



390 Ser Ala Glu Arg Pro 

E — > 391 230 

392 Ala Tyr Xaa Arg Gin 

E — > 393 245 

394 Thr Ser Xaa Ser Xaa 
E — > 395 255 

396 Ala Leu lie Lys Gly 
E — > 397 265 

398 Glu Trp Lys Gin lie 
E — > 399 280 

400 Ala Val lie Leu Gly 
E — > 401 290 

402 Arg Val Val Thr Gly 
E — > 403 305 

404 Leu lie Gin Glu Gly 
E — > 405 315 

406 Pro Gly Leu Pro lie 
E — > 407 325 

408 Arg Asp Asn Val Val 
E — > 409 340 

410 Asp Tyr Val Glu Thr 
E--> 411 350 

412 Gly Asp Leu Leu Leu 
E— > 413 365 

414 Ala Gin Tyr Tyr lie 
E — > 415 375 

416 Asp His Gin Gly Lys 
E — > 417 385 

418 Trp Asp Arg Asn Gly 
E — > 419 400 

420 Thr Thr Ser lie Pro 
E — > 421 410 

423 Leu Ser Val Lys lie 
E — > 424 425 

425 Trp Glu Trp Trp Arg 
E — > 426 435 

427 Leu Xaa Leu Val Arg 
E — > 428 445 

429 Gly Thr Thr Leu Tyr 
E — > 430 460 

431 Glu Asn Asp 
E — > 432 470 



Leu Val Tyr lie Ser Xaa Val 
235 240 

Val Tyr Leu Lys Leu Glu Thr 

250 

Glu Val Glu Ala Ala Phe Glu 

260 

Val Lys Val Ala Pro Gin Thr 
270 275 

Leu Asp Asn Thr Xaa Val Lys 

285 

Gly Asp Pro Ser Ser Gly Ala 
295 300 

Lys Val Asp Met Val Glu Asp 

310 

Ser Arg Phe Thr Ala Asp His 

320 

Ser Tyr Thr Thr Ser Phe Leu 

330 335 

Ala Thr Phe Gin Asn Ser Thr 

345 

Lys Val Thr Ala Tyr Arg Asn 
355 360 

Asp His Ser Gly Ala Tyr Val 

370 

Thr Trp Xaa Glu Leu Ser Tyr 

380 

Glu Val Leu Thr Pro Lys Ala 
390 395 

Gin Asp Leu Thr Ala His Phe 

405 

Leu Lys Gly Asn Val Arg Asn 
415 420 

Arg Glu Cys Thr Gly Leu Ala 

430 

Thr Val Tyr Glu Lys Thr Asp 

440 

Lys Arg Thr lie Ser lie Trp 
450 455 

Pro Gin Val Glu Asp Lys Val 

465 
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VERIFICATION SUMMARY DATE: 03/05/2004 

PATENT APPLICATION: US/10/785,673 TIME: 12:01:55 

Input Set : A:\17584036.txt 

Output Set: N: \CRF4\03052004\ J785673 . raw 



L:6 M:220 C: Keyword misspelled or invalid format, 
L:33 M:220 C: Keyword misspelled or invalid format 
L:34 M:220 C: Keyword misspelled or invalid format 
L.72 M.lll C: (47) String data converted to upper 

M:lll Repeated in SeqNo=l 

L.162 M:lll C: (47) String data converted to upper 

M:lll Repeated in SeqNo=2 



[ (i) 

, [ (A) 

/ [ (B) 

case^ 



case. 



APPLICANT: ] 
APPLICATION 
FILING DATE 



NUMBER: ] 

] 



L: 245 M: 246 W: 
L: 344 M: 246 W: 
L: 353 M: 341 W: 
M:341 Repeated 



nvalid value of Alpha Sequence Header Field, [MOLECULE TYPE:], 
Invalxd value of Alpha Sequence Header Field, [MOLECULE TYPE-f 
(46) "n" or "Xaa" used, for SEQ ID#:4 after pos.:12 

in SeqNo=4 



FoL ^=332 E: (32) Invalid/Missing Amino Acid Numbering, SEQ ID: 4 

M:332 Repeated in SeqNo=4 



SeqNo=3 

SeqNo=4 



L: 443 M:246 W: 
L: 450 M: 111 C: 
L:461 M:246 W: 
L: 468 M: 111 C: 
L: 480 M: 246 W: 
L:487 M:lll C: 
L: 498 M:246 W: 
L:505 M:lll C: 
L:517 M:246 W: 
L:524 M:lll C: 
L:535 M:246 W: 
L:542 M:lll C: 
L:553 M:246 W: 
L:560 M:lll C: 
L:572 M:246 W: 
L:579 M:lll C: 
L:590 M:246 W: 
L:597 M:lll C: 
L: 609 M:246 W: 
L:616 M:lll C: 
L:626 M:246 W: 
L:633 M:lll C: 
L:644 M:246 W: 
L:651 M:lll C: 
L:663 M:246 W: 
L:670 M:lll C: 
L: 681 M:246 W: 
L:688 M:lll C: 



Invalid value of Alpha Sequence Header Field, 
(47) String data converted to upper case. 
Invalid value of Alpha Sequence Header Field, 
(47) String data converted to upper case. 
Invalid value of Alpha Sequence Header Field, 
(47) String data converted to upper case. 
Invalid value of Alpha Sequence Header Field, 
(47) String data converted to upper case. 
Invalid value of Alpha Sequence Header Field, 
(47) String data converted to upper case. 
Invalid value of Alpha Sequence Header Field, 
(47) String data converted to upper case. 
Invalid value of Alpha Sequence Header Field, 
(47) String data converted to upper case. 
Invalid value of Alpha Sequence Header Field, 
(47) String data converted to upper case. 
Invalid value of Alpha Sequence Header Field, 
(47) String data converted to upper case. 
Invalid value of Alpha Sequence Header Field, 
(47) String data converted to upper case. 
Invalid value of Alpha Sequence Header Field, 
(47) String data converted to upper case. 
Invalid value of Alpha Sequence Header Field, 
(47) String data converted to upper case. 
Invalid value of Alpha Sequence Header Field, 
(47) String data converted to upper case. 
Invalid value of Alpha Sequence Header Field, 
(47) String data converted to upper case. 



[MOLECULE TYPE : ] , SeqNo=5 
[MOLECULE TYPE:], SeqNo=6 
[MOLECULE TYPE:], SeqNo=7 
[MOLECULE TYPE:], SeqNo=8 
[MOLECULE TYPE:], SeqNo=9 
[MOLECULE TYPE:], SeqNo=10 
[MOLECULE TYPE:], SeqNo=ll 
[MOLECULE TYPE:], SeqNo=12 
[MOLECULE TYPE:], SeqNo=13 
[MOLECULE TYPE:], SeqNo=14 
[MOLECULE TYPE:], SeqNo=15 
[MOLECULE TYPE:], SeqNo=16 
[MOLECULE TYPE:], SeqNo=17 
[MOLECULE TYPE:], SeqNo=18 
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